CDS

Accession Number TCMCG075C07536
gbkey CDS
Protein Id XP_017971719.1
Location complement(join(36335907..36335916,36336076..36336222,36336310..36336435,36337069..36337160,36339156..36339241,36339329..36339439,36339558..36339594))
Gene LOC18609860
GeneID 18609860
Organism Theobroma cacao

Protein

Length 202aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018116230.1
Definition PREDICTED: probable peroxygenase 4 isoform X3 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description peroxygenase
KEGG_TC -
KEGG_Module -
KEGG_Reaction R09462        [VIEW IN KEGG]
R09463        [VIEW IN KEGG]
KEGG_rclass RC01711        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K17991        [VIEW IN KEGG]
EC 1.11.2.3        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00073        [VIEW IN KEGG]
map00073        [VIEW IN KEGG]
GOs GO:0001101        [VIEW IN EMBL-EBI]
GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0006950        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009414        [VIEW IN EMBL-EBI]
GO:0009415        [VIEW IN EMBL-EBI]
GO:0009628        [VIEW IN EMBL-EBI]
GO:0009719        [VIEW IN EMBL-EBI]
GO:0009725        [VIEW IN EMBL-EBI]
GO:0009737        [VIEW IN EMBL-EBI]
GO:0009819        [VIEW IN EMBL-EBI]
GO:0010033        [VIEW IN EMBL-EBI]
GO:0010035        [VIEW IN EMBL-EBI]
GO:0016298        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0016788        [VIEW IN EMBL-EBI]
GO:0033993        [VIEW IN EMBL-EBI]
GO:0042221        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]
GO:0097305        [VIEW IN EMBL-EBI]
GO:1901700        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCTTCGTCTTTGTCATCAAAAGATTTCCAAGAGGGAGTAGATGAAGGGAGACCTATTCCTACTGACCAGAACGTTCTGCAAAAGCATGTTGCGTTCTTTGACAGGAACCATGATGGCCTCATTTATCCATGGGAGACTTTTGAAGGTTTTCGAGCGATTGGGGCTGGTTATATGTTGTCAATTGCCAGTGCTATCTTAATCAATATTGCTCTCAGTCGCAAAACTCGCGCTGGGAAATTTTCTTTGCTCTTCCCCATTGAGGTTAAAAACATTCACAAGGCAAAACATGGGAGTGACTCTGGCGTATACGACAGTGAAGGAAGGTTTGTGCCCTTGAAGTTTGAAGAAATCTTCAGCAAATTCGCCAGGACACATTCCAATGCCTTAACATCTGATGAACTGAAGGCAATGCTCAAGGCTAATAGGGAACCACAGGATTACAAAGGATGGATTGGTAGCTGGACAGAATGGAGGACTTTATACAGTCTATGCAAGGACAAACATGGTTTATTGCGTAAAGAAACAATTAGAGCTGTTTATGATGGAAGCCTCTTTGAACATATGGAGAGGGAAAGAGCAGCTGCCAAAAAGAAAGCCTGTATGTAA
Protein:  
MASSLSSKDFQEGVDEGRPIPTDQNVLQKHVAFFDRNHDGLIYPWETFEGFRAIGAGYMLSIASAILINIALSRKTRAGKFSLLFPIEVKNIHKAKHGSDSGVYDSEGRFVPLKFEEIFSKFARTHSNALTSDELKAMLKANREPQDYKGWIGSWTEWRTLYSLCKDKHGLLRKETIRAVYDGSLFEHMERERAAAKKKACM